Midbrain Dopamine Neurons Encode a Quantitative Reward Prediction Error Signal

نویسندگان

  • Hannah M. Bayer
  • Paul W. Glimcher
چکیده

The midbrain dopamine neurons are hypothesized to provide a physiological correlate of the reward prediction error signal required by current models of reinforcement learning. We examined the activity of single dopamine neurons during a task in which subjects learned by trial and error when to make an eye movement for a juice reward. We found that these neurons encoded the difference between the current reward and a weighted average of previous rewards, a reward prediction error, but only for outcomes that were better than expected. Thus, the firing rate of midbrain dopamine neurons is quantitatively predicted by theoretical descriptions of the reward prediction error signal used in reinforcement learning models for circumstances in which this signal has a positive value. We also found that the dopamine system continued to compute the reward prediction error even when the behavioral policy of the animal was only weakly influenced by this computation.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Systems Neuroscience: Shaping the Reward Prediction Error Signal

A recent study shows that midbrain GABA (inhibitory) neurons code for environmentally predicted rewards. These GABA neurons communicate with dopamine neurons, where the reward prediction is subtracted from delivered reward. Thus, the GABA prediction signal shapes the dopamine reward prediction error signal.

متن کامل

The Dopamine Prediction Error: Contributions to Associative Models of Reward Learning

Phasic activity of midbrain dopamine neurons is currently thought to encapsulate the prediction-error signal described in Sutton and Barto's (1981) model-free reinforcement learning algorithm. This phasic signal is thought to contain information about the quantitative value of reward, which transfers to the reward-predictive cue after learning. This is argued to endow the reward-predictive cue ...

متن کامل

Reward prediction error computation in the pedunculopontine tegmental nucleus neurons.

In this article, we address the role of neuronal activity in the pathways of the brainstem-midbrain circuit in reward and the basis for believing that this circuit provides advantages over previous reinforcement learning theory. Several lines of evidence support the reward-based learning theory proposing that midbrain dopamine (DA) neurons send a teaching signal (the reward prediction error sig...

متن کامل

Dopamine reward prediction error coding

Reward prediction errors consist of the differences between received and predicted rewards. They are crucial for basic forms of learning about rewards and make us strive for more rewards-an evolutionary beneficial trait. Most dopamine neurons in the midbrain of humans, monkeys, and rodents signal a reward prediction error; they are activated by more reward than predicted (positive prediction er...

متن کامل

Reward prediction error signals by reticular formation neurons.

As a key part of the brain's reward system, midbrain dopamine neurons are thought to generate signals that reflect errors in the prediction of reward. However, recent evidence suggests that "upstream" brain areas may make important contributions to the generation of prediction error signals. To address this issue, we recorded neural activity in midbrain reticular formation (MRNm) while rats per...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Neuron

دوره 47  شماره 

صفحات  -

تاریخ انتشار 2005